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We consider the adiabatic quantum algorithm for systems with "no sign problem" , such as the 
transverse field Ising mode, and analyze the equilibration time for quantum Monte Carlo (QMC) 
on these systems. We ask: if the spectral gap is only inverse polynomially small, will equilibration 
methods based on slowly changing the Hamiltonian parameters in the QMC simulation succeed in 
a polynomial time? We show that this is not true, by constructing counter-examples. In some 
examples, the space of configurations where the wavefunction has non-negligible amplitude has a 
nontrivial fundamental group, causing the space of trajectories in imaginary time to break into dis- 
connected components with only negligible probability outside these components. For the simplest 
example we give with an abelian fundamental group, QMC does not equilibrate but still solves the 
optimization problem. More severe effects leading to failure to solve the optimization can occur 
when the fundamental group is a free group on two generators. Other examples where QMC fails 
have a trivial fundamental group, but still use ideas from topology relating group presentations to 
simplicial complexes. We define gadgets to realize these Hamiltonians as the effective low-energy 
dynamics of a transverse field Ising model. We present some analytic results on equilibration times 
which may be of some independent interest in the theory of equilibration of Markov chains. Con- 
versely, we show that a small spectral gap implies slow equilibration at low temperature for some 
initial conditions and for a natural choice of local QMC updates. 



The quantum adiabatic algorithm^ uses a parameter- 
dependent Hamiltonian H s to find the ground state of a 
classical optimization algorithm. The Hamiltonian Hq is 
chosen to have a simple ground state, typically of product 
form, that can be easily prepared. After initializing the 
system in this state, the system is evolved under a time- 
dependent Hamiltonian, H s n\, with s(t) slowly changing 
as a function of time t, until at some final time tf with 
s(tf) — 1, the Hamiltonian describes some classical op- 
timization problem. If the rate of change is sufficiently 
slow, then the system stays close to the ground state 
throughout this procedure, thus solving the optimization 
problem. The original adiabatic quantum algorithm re- 
lied on coherent evolution with a sufficiently large gap 
A between the ground and first excited state along the 
path of H s . The time required for the algorithm scales 
1/A 2 , and depends also upon the norm of terms in the 
Hamiltonian. Later work considered effects of incoher- 
ent evolutioiP. One potential advantage of the adiabatic 
algorithm is that it is a general purpose tool; similarly 
to the classical simulated annealing algorithm, while the 
quantum adiabatic algorithm may not be the best tool 
for any given problem, it can be applied to a wide range 
of problems. 

Unfortunately, little is known definitely about the per- 
formance of the adiabatic quantum algorithm for practi- 
cal problems, despite much work on studying small sys- 
tems using exact diagonalizatiorP and larger systems us- 
ing numerical quantum Monte Carlo (QMC) studies^. 
One remarkable recent study^ is based directly on study- 
ing the Hamiltonian of the D-Wave device^, showing 
evidence for nontrivial collective quantum effects. The 
QMC studies, and this device, all involve Hamiltonians 
with "no sign problem" , as explained below; in particu- 
lar, they are transverse field Ising models. 



While the quantum adiabatic algorithm with arbitrary 
Hamiltonians is known to be as powerful as the circuit 
model for quantum computation-^, it is unclear what ad- 
vantage such Hamiltonians with no sign problem can 
have over classical computation. Perhaps one can always 
simulate such systems using QMC? In particular, since 
the main theoretical problem with QMC is understanding 
the equilibration time, perhaps a bound on the spectral 
gap A implies some bound on the equilibration time, if we 
follow certain protocols for equilibrating the QMC simu- 
lation? In this paper, we show that for the most natural 
equilibration protocol, where one equilibrates the QMC 
at s = and then slowly changes s and tries to equi- 
librate the QMC s changes, this is not true in general. 
We give several counter-examples (we modify the algo- 
rithm and protocol to account for some of the counter- 
examples, but then find other counter-examples to the 
modified algorithm). 

One reason that we might expect this annealing pro- 
tocol to work is that a similar approach does work when 
using a matrix product state algorithm (instead of QMC) 
to study a one-dimensional quantum system with a spec- 
tral gap, in that we exploit the idea of following the ma- 
trix product state along the path to avoid problems with 
getting trapped i n a local minimurrP. That result ex- 
ploits the area law^A Another result along these lines is 
that if we restrict to frustration-free Hamiltonians with- 
out a sign problem, then the problem of simulating adi- 
abatic evolution is in the complexity class BPP^. Note 
also that if we leave the context of adiabatic comput- 
ing, the problem of approximating the ground state en- 
ergy of sign-problem free Hamiltonians is irP^l complexity 
class AM, while the analogous problem for arbitrary local 
Hamiltonans is QMA-complete^. 

For a further illustration of why we might believe that 
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the spectral gap A is related to the equilibration time of 
quantum Monte Carlo, consider a single particle Hamil- 
tonian in a tilted double-well potential: 

H = -J-d 2 + nx 2 + x A + hx, (0.1) 
2m 

with [i negative. Suppose we start at positive h, where 
the ground state has most of its amplitude in the left 
well, and then change the sign of h, trying to equilibrate 
the QMC as h is changed. Problems with equilibration 
may occur for large \fi\, where the barrier between the 
wells is large, because the trajectory of the particle in 
imaginary time can get stuck in the left well, unable to 
tunnel through the barrier. However, if the barrier is in- 
deed high, then at h = 0, the spectral gap becomes small. 
So, based on this simple example one might expect a con- 
nection between spectral gap and equilibration of QMC. 
We emphasize that the connection between spectral gap 
and equilibration only appears here when we follow this 
particular annealing protocol; if we instead start at non- 
negligible h but with the initial condition of the particle 
in the wrong well, then the QMC might be slow to equi- 
librate even though there is no spectral gap. 

However, this example is really based on having the 
space of likely positions of the particle split into two dis- 
connected sets, the left and right wells, with the coordi- 
nate x = being unlikely to occur. That is, one may 
say this is an example of a nontrivial ttq, the zeroth ho- 
motopy group, of the space of likely positions. QMC, 
however, considers a trajectory in imaginary time, and 
so one might expect obstructions based on a nontrivial 
7Ti, the fundamental group. This is, in fact, the basis for 
some of our examples below. 

In a sense, this kind of obstruction is well-known. For 
example, problems with equilibrating different winding 
number sectors when simulating particles on a torus are 
well-studied and various nonlocal update rules have been 
introduced to try to alleviate this problem^. However, 
these nonlocal updates are often introduced in a way that 
is quite specific to the particular Hamiltonian considered 
and we do not know a general way to implement them 
that would deal with the cases considered later, so we do 
not consider nonlocal updates further. 

Further, while problems with equilibrating different 
topological sectors have been considered before, typically 
this has been studied for abelian fundamental groups 
such as the fundamental group of the circle or torus while 
we consider cases where the fundamental group is a free 
group on two generators. The non-abelian nature of this 
group leads to significantly worse effects from the differ- 
ent winding number sectors, as discussed below. 

After giving these counter-examples with a nontrivial 
fundamental group, we show how to realize them using 
gadgets. Then, we give counter-examples with a trivial 
fundamental group. Topology still plays a role in these 
examples, as we relate a certain presentation of the trivial 
group to a simplicial complex. These examples build on 
the examples with nontrivial fundamental group, so that 
section should be read first. 



Finally, we present more analytic results. We show 
that the converse of the conjecture is true: a small spec- 
tral gap implies slow equilibration for certain initial con- 
ditions. We also present some upper bounds on the equi- 
libration time. 

An appendix at the end of the paper, due to M. H. 
Freedman, provides some additional geometric and topo- 
logical context to phenomena discussed in the main text. 

1. REVIEW OF ADIABATIC ALGORITHM, 
SIGN PROBLEM, AND THE ANNEALING 
PROTOCOL 

The adiabatic algorithm considers a parameter depen- 
dent Hamiltonian H s . A typical application of this al- 
gorithm would be a system of N spin-1/2 spins with a 
parameter dependent Hamiltonian H s of the form 

H s = -(l-s)J2S? + sV, (1.1) 

i 

where i — 1, N labels the different sites, Sf is the x- 
component of the spin on site i and V is some operator 
which is diagonal in the z-basis. The operator V will 
typically be a sum of many terms, each depending upon 
a small number of spins. For example, one could have 
V = J2i j JijSfSj f° r some matrix J^. We will assume 
some polynomial bound on the norm of the terms in the 
Hamiltonian. The goal of the algorithm is to find a con- 
figuration of the spins in the z basis which minimizes V. 
Such a configuration could be the solution of some clas- 
sical optimization problem, with the particular problem 
being encoded in the matrix J. Note that the ground 
state of H is a product state with all spins polarized in 
the ^-direction and so can be prepared easily. 

The performance of the algorithm depends upon the 
minimum spectral gap A along the path. As a minor 
technical point, for some optimization problems, we find 
that V has a degenerate ground state. In this case, the 
gap between the ground and first excited state goes to 
zero at the end of the path. This does not pose a problem 
as any of the final states represents a solution to the 
optimization problem. Alternately, suppose that for most 
of the path (namely, for the portion of the path with B > 
l/poly(iV)), the inverse gap is at most polynomial. Then, 
we run the adiabatic algorithm for the portion of the path 
with B > l/poly(iV)), and then terminate. Measuring 
the final state in the z-basis will give an outcome that is, 
with probability 1 — l/poly(iV), a minimum of V. 

A. Sign Problem 

The Hamiltonian ( |1.1[ ) is an example of a Hamiltonian 
with "no sign problem" , enabling the use of a path inte- 
gral QMC algorithm explained below to study properties 
of the ground state. We now give a fairly general ex- 
planation of the sign problem and of a simplified QMC 
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algorithm. Consider a Hamiltonian H and a basis of state 
ip(c) where c is some discrete index. We refer to c as a 
"configuration" . Then, we say that "the Hamiltonian has 
no sign problem" if whenever c ^= d we have 



{il>(c),H^(d))<0 



(1.2) 



Of course, this definition is somewhat imprecise. For any 
Hamiltonian, we can find a basis transformation to diago- 
nalize the Hamiltonian in which case Eq. (1.2) is satisfied 



for that basis. However, for the QMC to be efficient, we 
want to be able to efficiently calculate (i/j(c), Hip(d)) for 
all c, d. To do this, we are often interested in the case 
that the basis i/j( c ) is a prod uct basis; i.e., we consider a 
system of N sites, as in Eq. ( |1.1[ ), with the Hilbert space 
of the whole system being the tensor product of the N 
different Hilbert spaces and the basis ip(c) should be a 
product basis. It should be noted that while Eq. (1.2) is 



sufficient not to have a sign problem, it is not necessary; 
for example by a basis change, various Heisenberg mod- 
els which seems to have a sign problem in one basis can 
be shown not to have a sign problem in another^. 

The simplest version of the path integral Monte Carlo 
works as follows. Consider the partition function Z = 
Tr(exp(— j3H)) for some given j3. We divide the system 
into K "time slices" for some integer K, introducing one 
index Ci per time slice and summing over all indices. We 
then write: 



Tr(exp(-/3#)) 



(1.3) 



K 



{Ci}i=l 

where we have i = 1, ...,K and where the sum is over all 
possible values of c\,...,Ck. We fix Cjc+i — C\; that is, 
Ci is periodic. We refer to a sequence of c\, ...,ck as a 
trajectory, saving the term "path" instead for a path in 
parameter space. We can then approximate the terms in 
the product by something which we can efficiently calcu- 
late 



Part of the description of a quantum Monte Carlo Al- 
gorithm is not just the space of states (in this case, the 
sequence of c%, ck) and the probabilities (given above) 
but also the transition rule. The simplest possible choice 
is a local update rule in which we randomly pick a given 
Ci and then try randomly changing the value of that Ci. 
For a system with an exponentially large number of pos- 
sible choices of Ci (as in ( 1.1 1, where each Cj takes one of 
2 N possible values), we often consider changing only the 
value of one or a small number of spins at a time. 

In fact, such a choice of transition rule is essential to 
defining what we mean by ~K\. If we have a discrete set 
of states ip(c), we can define a graph, with vertices of the 
graph corresponding to possible values of c, and edges 
between vertices c,d if (ip(c), Hip(d)) ^ 0. To define the 
concept of tti, we interpret the graph as a 1-complex, 
and we add some 2-cells to the 1-complex corresponding 
to different local updates that the QMC can implement: 
for example, if we have 3 different vertices, c, d, e, with 
edges connecting all three, and it is possible for a local 
update to change the sequence c\ = c, C2 = d, C3 = e 
into the sequence c\ — c,C2 — e,C3 = e, then we at- 
tach a 2-cell to those three 1-cells. If we can update 
a sequence c\ = c, C2 = d, C3 = e into the sequence 
Ci = c, C2 = d',ca = e then we attach a 2-cell to the 
four 1-cells corresponding to following four edges of the 
graph: (c,d), (d,e), (e,<f), (d',c). 

However, we will not consider this kind of more for- 
mal definition of a complex any further. The reason is, 
we would have to also in some way decide that config- 
urations which appear with negligible amplitude should 
be removed from the graph when computing tt (as in 
Eq. (0.1) or 7Ti. Currently, we have not given a pre- 



cise way of specifying how to remove these configura- 
tions. Because of this imprecision, we will be content 
with heuristically arguing below that certain algorithms 
will be unable to equilibrate in certain cases. 



B. Annealing 



(V>(c i+1 )|e X p(-/3F/i<Q|V>(c i )) (1.4) 
« Sc i+uCi - ^{ci +l )m{c)). 

For large enough K (polynomially large in f3 and in the 
norm of H), the approximation error becomes negligible. 

At this point, we now have expressed the partition sum 
as a sum over positive quantities which can be statisti- 
cally sampled using a Monte Carlo procedure. This al- 
lows one to directly determine observables which are di- 
agonal in the basis such as Z~ 1 (ip(c)\ exp(— j3H)\ip(c)) by 
measuring the probability distribution of the indices Cj. 
Using more sophisticated methods, it is possible to de- 
termine off-diagonal quantities; for example, see Ref. I16I 

The discretization described here is somewhat simpli- 
fied. In practice, more sophisticated methods are often 
used to deal with discretization in a more efficient way^. 



The most fundamental problem with QMC, as with 
any Monte Carlo algorithm, especially when applied to 
optimization, is the problem of equilibration. At s = 0, 
the system equilibrates rapidly, but in general, we ex- 
pect that equilibration for large enough s may be expo- 
nentially slow for some choices of the Hamiltonian for at 
least some choices of the initial state. 

However, there are many possible annealing protocols 
for the Monte Carlo dynamics. In classical Monte Carlo 
simulations, for example, we can follow a simulated an- 
nealing procedure of slowly reducing the temperature. 
Similarly, we can anneal the QMC dynamics using Hamil- 
tonian H = H s by slowly increasing /? or by slowly chang- 
ing s. This last is the case that we analyze in this paper: 
fixing [3 and slowly changing s. The goal is to change 
s sufficiently slowly that the procedure remains close to 
equilibrium throughout. A more sophisticated approach 
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than slowly changing s is to use a parallel tempering 
procedure (equilibrating at several different values of s 
simultaneously, and allowing moves that swap trajecto- 
ries between different values of s); we do not analyze this 
in this paper. See Refs. I4I5I18I for practical implemen- 
tations of some of these approaches. Later we briefly 
consider the case in which f3 is allowed to change during 
the annealing. 

The conjecture, then, is that if we start at s = with 
the Monte Carlo procedure equilibrated and if the param- 
eter s changes only by a small amount e from one step 
to the next, then if e is polynomially small (in N, /3, and 
the spectral gap A), then the procedure remains close to 
equilibrium for all s, equilibrating along the path in a 
polynomial time. It is important to emphasize that we 
do not conjecture that if there is a spectral gap, then the 
procedure at a fixed s equilibrates in polynomial time 
for any starting trajectory, which clearly would not be 
true because at s = 1 the procedure may not equilibrate 
rapidly, but only that the equilibration happens when we 
follow a particular path. 

C. Other Boundary Conditions 

Above, we have imposed periodic boundary conditions 
on Cj. Other boundary conditions are possible. For ex- 
ample, one can have open boundary conditions, where 
we evaluate 

(^(eM-mi^i) (1-5) 

K-l 

= E II^(c l+ i)|exp(-/3iJ/^)^(c,)), 

where ipi is the state J2 C IVK ))- O ne can a ^ so allow pa- 
rameters to change with imaginary time; instead of tak- 
ing H = H s for all times, we can choose weights 

J2 HiTpici+Jlexpi-PH^/KMia)), (1.6) 

{c,} i=l 

where s(i) depends upon i. These different boundary 
conditions will be considered below. 



2. COUNTER-EXAMPLES 

We now describe several systems which serve as 
counter-examples to the conjecture above. We will de- 
scribe these systems not in terms of Ising spin degrees of 
freedom, but rather in terms of a particle moving in some 
potential. That is, they will all be examples of single- 
particle quantum mechanics. The annealing protocol for 
each system consists not of changing a transverse field 
but of changing some other parameters, and we consider 
paths in parameter space that are not just linear interpo- 
lations between two different Hamiltonians but are more 



general paths. In the next section, we will describe some 
"gadgets" to define Ising spin systems whose effective 
dynamics mimics that of the systems considered in this 
section, though again we consider more general paths in 
parameter space, allowing arbitrary tuning of Ising cou- 
plings and transverse fields. 

We will describe the first system using a continuous 
space of states for the particle and use a discrete space 
of states for later examples. In the case of a continu- 
ous space of states, we will briefly explain how to dis- 
cretize the space of states. In the first three examples, 
the number of such discrete states will be proportional to 
a quantity that we write as M. We will refer to coupling 
constants being "polynomially large" if they are bounded 
by a polynomial in M, and we will consider a gap to be 
at most polynomially small if it is at least an inverse 
polynomial in M. When we define the gadgets later, the 
needed value of N (the number of Ising spins) will be a 
polynomial in M, so any quantities that are polynomials 
in M will then be polynomials in N also. Similarly, if 
a quantity is exponentially small in a polynomial in M 
(for example, tunneling between different winding num- 
ber sectors in the first example), it will be exponentially 
small in a power of N, albeit possibly a power less than 
1; in general, we call such quantities exponentially small 
and do not worry about the power in the exponent. In 
the fourth example, we will have exponentially many low 
energy states, and constructing the gadgets will be more 
complicated. 

The reason for giving several counter-examples is that 
we will modify the algorithm in attempts to deal with 
some of the early counter-examples, and then we will pro- 
vide further counter-examples which show that even the 
modified algorithm does not work in general. Also, the 
later counter-examples show more severe effects and over- 
come some unsatisfactory features of the early counter- 
examples. 

A. First Example: Circle 

The first system (we will see later why we refer to this 
as a circle) is single particle quantum mechanics of a par- 
ticle moving in two-dimensions with the Hamiltonian 

H=-±d 2 + V(x,y), (2.1) 
2m 

where the derivative d 2 = d 2 + d 2 . We choose the poten- 
tial 

V(x, y) = fi(x 2 + V 2 ) + 9(x 2 + V 2 ) 2 - hx. (2.2) 

We will fix g = 1 throughout but we will change the cou- 
pling constants fj,, m, h along the annealing path. Note 
that at h = 0, for \i < 0, the minimum of the potential 
is such that x 2 + y 2 = —fi/2g. 

The point of this example is not to show that the QMC 
procedure is unable to find the ground state, but rather to 
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show that the QMC procedure must take an exponential 
time to equilibrate despite having only a polynomially 
small gap, thus contradicting the conjecture above. We 
will develop other examples later where QMC does not 
find the ground state. 

The continuous form is given for illustrative purposes 
only and we will work with a discretization instead. Let 
us define quantities R, a, with a << 1 and R » 1. 
The quantity a will define a discretization scale at short 
distances and R will define a maximum distance. We will 
let x, y become discrete, each being equal to an integer 
multiple of a, such that < R. The discretized 

Hamiltonian is 



H = - 



(2.3) 



x.y 



Note that M ~ (R/a) 2 . 

We follow the following annealing path. We begin at 
fx = h = and m = 1. Then, we make fx become nega- 
tive, keeping h = 0, until fx becomes equal to — R 2 . For 
negative fx, the minimum of the potential is at 



V 



2' 



(2.4) 



so at [i = R 2 /2 the minimum of the potential is at 
x 2 + y 2 = R 2 /2. By choosing a to be 1/poly (R), we 
can make the effects of discretization negligible. Once 
fx becomes sufficiently negative, the particle is very nar- 
rowly confined near the minimum of the potential: trans- 
verse motion has a very high energy cost, with the sec- 
ond derivative of V in the perpendicular direction being 
of order fx 2 . The effective dynamics is that of a particle 
moving in a circle (hence the name of this example) , with 
Hamiltonian 



-1 



2mri 



-d 2 + hRcos(9), 



(2.5) 



where the angular variable 9 is periodic with period 2-k 
and where we have included the term h in the poten- 
tial because the next step involves increasing h. The 
low-lying excited states at h = correspond to different 
angular momentum modes, with the lowest states having 
zero angular momentum and the first excited state being 
doubly degenerate, having angular momentum ±1. The 
energy splitting is of order l/r^ in , and hence the gap 
remains at most polynomially small up to this point. 

We next increase h from to 1. The gap is at most 
polynomially small in R throughout this path. At h = 1, 
we can approximate the Hamiltonian by expanding near 
61 = 0: 



R. 



const. 



(2.6) 



2mR 2 2 

which is a harmonic oscillator Hamiltonian. The wave- 
function decays exponentially for \9\ > 1/i? 3 / 4 , and so 
the particle is localized near 9 = 0. 



Finally, we increase m to infinity. Note that increas- 
ing m corresponds to decreasing the corresponding term 
in the Hamiltonian, and is analogous to turning off the 
transverse field. Note also that because of the discretiza- 
tion, the gap remains at most polynomially small as m 
is decreased for appropriate choices of R, as follows. At 
772 = oo , the eigenfunctions are localized on a single state 
\x,y), and so the eigenvalues are just the different values 
of the potential V(x, y) at appropriate discrete values of 
x,y. One could pick R badly so that V(x,y) has a pair 
of degenerate minima in the discrete case, but it is easy 
to avoid this. At the end of this process, the particle has 
found its ground state. 

Now, let us analyze what happens with the QMC an- 
nealing procedure. The worldline of the particle is some 
closed path in imaginary time. Once fx is sufficiently 
negative, the distribution of trajectories has very small 
probability to include any point with radius x 2 +y 2 which 
differs much from r^^/2 and we expect that it takes an 
exponential time to transition from one winding number 
sector to another. 

We can estimate the equilibrium winding number of 
a trajectory for Hamiltonian (2.5) at h = and arbi- 
trary r^ in as follows. The contribution to the partition 
function of the trajectories with winding number n is 
proportional to the Green's function G(0, 2irn; /3), where 
this denotes the Green's function for a particle moving 
on the line (i.e., the universal cover of the circle) to move 
from to 27m in imaginary time /3. This contribution is 
proportional to exp(— mr^ in (27r7i) 2 /2/3)), which decays 
exponentially for 77 g2 (3/m/r m i n . Hence, a typical tra- 
jectory will have a winding number 77 ~ 

Note that this equilibrium value depends upon r min , 
so the system must necessarily fall out of equilibrium, as- 
suming that fx changes on a time scale faster than the ex- 
ponential time required to change between winding num- 
ber sectors. This already contradicts the conjecture that 
QMC will equilibrate. A similar effect would happen if 
we considered an annealing protocol, such that we equili- 
brated the system at fixed, large, negative fx (taking the 
exponential time necessary to equilibrate between wind- 
ing number sectors) and then increased m. 

Now we return to the annealing protocol discussed 
above, where h increases. In this case, if the sy stem 
is stuck with x 2 + y 2 



R 2 /2 so that Hamiltonian (2.5) 



applies, what we find is that the trajectory spends most 
of its time near 9 = 0, and then spends the rest of the 
time winding around. Suppose, for example, the system 
is stuck in a sector with winding number We can 

calculate an instanton trajectory (i.e., find a minimum 
action trajectory with the given winding number). The 
minimum action solution can be described by a parti- 
cle which spends a long time near 9 — at a slow speed, 
then accelerates and rapidly winds around the circle, then 
again spends a long time near 9 = 0, and again rapidly 
winds around the circle, and so on, until it winds a total 
of 77 times. We can give a quick estimate of the time it 
spends winding as follows. Suppose out of a total imagi- 
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nary time /?, the system spends time j3 — r close to 6 = 
and spends time t << f3 rapidly winding n times around 
the circle. We can estimate the optimum time r by esti- 
mating the action for the optimal trajectory with given 
t and minimizing over r. The action to wind n times in 
time r is of order mR 2 n 2 /r + hrR where the first term 
comes from the kinetic energy and the second term comes 
fro m the p otential energy. This is minimzed at r of order 
ny/Rm/h. 

If we fix R and take (3 large, since n <~ y/fiwe have 
t <~ v 7 ? so that the trajectory spends most of its time 
near 9 = 0. Thus, in some sense the QMC procedure 
does succeed in finding the minimum as the trajectory 
spends most of the time in the correct place, assuming 
that (3 is sufficiently large. 

It is interesting to analyze what happens as to is in- 
creased to infinity, starting at h = 1. The time r ~ 
fiyf Rm/h estimated above increases, eventually becom- 
ing of order j3. However, for sufficiently large to, the 
trajectories stop being localized near x 2 + y 2 = R 2 /2 and 
instead the trajectory gets localized at smaller x 2 + y 2 . 
We can understand this as a balance of two terms (we will 
consider h = for simplicity): if the trajectory is local- 
ized at a given distance r, so that x 2 + y 2 w r, the action 
to wind n times in time (3 is of order mr 2 n 2 /f3 + V(r), 
and for large to this is minimized at small r, so at large 
enough to the trajectories start to have non-negligible 
probability to have x = y = 0. Thus, at large to, the 
system is able to change its winding number to zero in a 
time which is not exponentially long because the trajec- 
tories move to smaller r. 

Eventually, at very large to, the trajectory becomes 
close to constant in imaginary time (each Cj is close to 
all other Cj). At this point, the QMC dynamics becomes 
similar to a classical Monte Carlo dynamics as the trajec- 
tory is almost determined by its value at a given time slice 
and so we can just study a classical Monte Carlo proce- 
dure with weight exp(— /3V(x, y)). This is unsurprising, 
as at large to, the dynamics becomes more classical. At 
large (3, this classical Monte Carlo procedure is similar to 
a greedy algorithm, as with high probability the particle 
only moves to lower potential states. 

The particular potential we have chosen has the prop- 
erty that it has only one local minimum. As a result, the 
classical Monte Carlo dynamics will not get trapped and 
eventually the system will equilibrate at large enough 
to, being stuck just at the minimum. We can modify 
the example by changing the potential near x = y = 0, 
adding an additional minimum there, to trap the large 
to dynamics to construct an example which prevents this 
equilibration. 

To summarize: this example uses winding number as 
a topological invariant of trajectories to construct an 
annealing protocol for which the QMC does not relax 
rapidly. However, for various reasons, this example is 
not completely satisfactory as a counter-example to the 
idea that QMC will succeed in finding the minimum when 
the annealing algorithm does. One such reason is that if 



P is sufficiently large, the QMC does produce trajectories 
which spend most of their time near the desired minimum 
at the point in the annealing protocol when to = h = 1. 
A second reason is that the QMC algorithm has some 
probability of being in the trivial sector with winding 
number n = 0, where it can more readily equilibrate at 
intermediate values of to (i.e., small enough to that the 
winding number sector is still fixed but large enough that 
the equilibrium distribution is dominated by the sector 
n = 0) and this probability of being in the trivial sector 
is only polynomially small. 

One interesting attempt to modify the QMC procedure 
to help equilibration, or at least to ameliorate the effects 
of being stuck in a sector with non-zero winding number, 
is to change (3 during the procedure. Let us again return 
to analyzing the protocol where h is kept at 0, but to is 
increased. In this case, we could allow {3 to increase also, 
and if /3/m are in the right ratio, the system will remain 
in equilibrium at given (3. We will discuss ideas like this 
again in later examples. 



B. Second Example: Bouquet of Circles, "Too 
Long a Word" 

The first example was based on a case where the parti- 
cle was confined (up to exponentially small corrections) 
to a circle. The fundamental group of the circle is Z 
and is abelian. In this case, the equilibrium state had 
a winding number proportional to the square-root of j3. 
In the next example, we consider a case where the fun- 
damental group is a non-abelian group. We consider a 
system where the particle is confined to a space which 
is a bouquet of circles. A bouquet of circles consists of 
several circles glued together at one point. The funda- 
mental group of a bouquet of n circles is the free group 
on n generators. For simplicity, we consider a bouquet 
of 2 circles. 

This example makes the effects in the previous section 
more severe, especially in the large f3 case. This example 
also introduces ideas used in later examples. 

The different topological sectors are described by 
words in the free group. For example, if we have two 
generators, called a, 6, then a possible word is aba~ x b. 
Words such as abaa~ 1 a~ 1 bb can be reduced by cancelling 
the successive appearance of a generator (a) and its in- 
verse (a -1 ), and in fact abaa~ 1 a~ 1 b = aba~ 1 b, and the 
two words describe the same topological sector. Further, 
we can cyclically reduce a word (cancel a generator at 
the start of the word against its inverse at the end) and 
& _1 a&a _1 i>& describes the same sector as aba~ 1 b. 

For a given topological sector, we can ask for the length 
of the short possible word that describes that sector. This 
is the length of the cyclically reduced word. Thus, for 
aba~ 1 b, the length is 4. 

A possible system with such a fundamental group has 
2M — 1 basis states. There are two sequences of states, 
labelled \i, a) and \i, b), where i is an integer in 1, M — 
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FIG. 2.1: Illustration of a graph corresponding to a bouquet 
of two circles for M = 8. 



1. Additionally there is one other state labelled |0). We 
will construct a Hamiltonian whose effective low energy 
dynamics is given by H bouquet , defined by 

M-2 

Hbouquet = - ^ (\i + l,x){i,x\+h.c, 

i=l x£{a,b} 
M-l 

+2^ £ \i,x)(i,x\ (2.7) 

»=1 x£{a,b} 
x£{a,b} 

- £ (\M- l,x){0\ +h.c.) 

x£{a.b} 

+4|0)(0|. 

The diagonal terms in the Hamiltonian are chosen so that 
the ground state of H is an equal amplitude superposition 
of all states. 

See Fig. |2.1| The Hamiltonian is equal to the graph 
Laplacian on the graph shown, where we have shown the 
case M = 8. The graph Laplacian has an off-diagonal 
element equal to —1 between any two vertices connected 
by an edge, and has diagonal elements equal to the degree 
of a given vertex (so the vertex at the middle of the figure 
has degree 4 and all the others have degree 2). 

The Hamiltonian Hbouquet will be an effective Hamilto- 
nian for some other Hamiltonian with a larger number of 
states, similar to howi? c j rc was an effective Hamiltonian 
previously. We add additional states to the system and 
follow some annealing protocol so that at some point in 
the protocol Hbouquet becomes a description of the effec- 
tive dynamics and so that the typical trajectory created 
by the QMC algorithm is in a topological sector whose 
shortest word length is proportional to (3. We now sketch 
one way to do this, but the reader can certainly imagine 
many possible ways. We have drawn the bouquet of cir- 
cles in the plane. We can imagine that the particle moves 
through the plane similarly to the previous case, and that 
it is some potential V(x, y) that confines it to the two cir- 
cles (and that also produces the appropriate diagonal and 



off-diagonal terms in Hbouquet)) ■ We can imagine that we 
follow an annealing protocol so that initially the particle 
is able to move throughout some region of the plane, and 
that we change some parameter so that eventually the 
particle gets confined to the given bouquet. 

In order to exponentially suppress the motion away 
from the two circles, we might want to take M polyno- 
mially large. Also, later we like to take M polynomially 
large to localize certain states as we turn on a poten- 
tial term h in Eq. (2.8) below. Otherwise, the particular 



value of M is not that important, though we do need 
M > 3 to define the direction of winding around a circle. 

Having quenched to this Hamiltonian Hbouquet , we now 
imagine an annealing protocol for a Hamiltonian of the 
form 



H 



1 



-H 



bouquet 



h\0)(0\. 



(2. 



We pick h > and the term h is added to produce a 
minimum in the potential. The particular choice of the 
minimum being state |0) as opposed to some other state 
is unimportant. Changing m plays a similar role to be- 
fore, and at large m the equilibrium trajectory is close 
to constant. The annealing protocol from this point is: 
start at m = 1 and h = 0. Then, increase h to 1. Finally, 
increase m to oo. 

As in the previous case, if we change m but keep h 
fixed at zero, the system must fall out of equilibrium if 
it is unable to transition between different topological 
sectors. To analyze this, in equilibrium, at h = 0, the 
length of the short possible reduced word for a typical 
winding number sector is as claimed above, 



const. ■ (3/{mM 2 ) 



(2.9) 



This follows from the fact that a trajectory in imaginary 
time describes a closed random walk on this bouquet of 
circles. It takes time of order mM 2 for the random walk 
to go once around a circle, and so the trajectory corre- 
sponds to a random word of length /3/M 2 . However, for 
a random word of given length on the free group with two 
generators, the length of the corresponding cyclically re- 
duced word is typically only a constant fraction smaller 
than the given random word. So, as m changes, this word 
length changes. 

We now analyze what happens as h increases at fixed 
m = 1. In this case if we take h large, we can make the 
ground state localized near |0). Indeed, if we take h of 
order unity, then the ground state is exponentially local- 
ized near |0), and by taking M large we can exponentially 
suppress states of distance ~ M/2 from |0). However, 
suppose we have a trajectory stuck in a topological sec- 
tor with word length of order /?. Then, we can perform 
a similar instanton analysis as before. We do the instan- 
ton analysis by going to a continuum limit and finding a 
minimum action trajectory. We study this trajectory on 
the universal cover of the bouquet of circles (this cover 
is a tree graph), where the the trajectory travels a dis- 
tance of order the word length (i.e., of order j3) in time 
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j3. So, the trajectory has an action proportional to j3. 
At the point that m = h = 1 in the annealing protocol, 
the fraction of time that this trajectory spends near |0) 
is /? independent, differing from what wc found when the 
target space was just a single circle where in the large 
j3 limit the trajectory spends most of its time near |0). 
This is a result of the word length being of order j3 now 
rather than y 7 /?- as previously 

To summarize, again we find problems equilibrating, 
and again it occurs because the system is typically stuck 
in a nontrivial topological sector. We have referred to 
this example as "too long a word" for this reason. 

C. Third Example: Bouquet of Circles, "Too Short 
a Word" 

Since the previous examples were both based on a situ- 
ation in which the system is stuck in a nontrivial topolog- 
ical sector, but the minimum action sector is the trivial 
topological sector, one might imagine trying to modify 
the QMC algorithm to cause the dynamics to be stuck 
in the trivial sector, or at least stuck in a sector with 
a shorter word length than typical. For example, we 
could follow an annealing protocol in which we change 
both (3 and s. One possibility would be to increase (3 
during the annealing protocol. We could either increase 
/3 while keeping the number of time slices constant (in 
which case the change in f3 leads to a change in the sta- 
tistical weights for a trajectory), or we could also change 
the number of time slices. For example, one possible way 
to increase the number of time slices by one is to replace a 
trajectory C\,...,Ck by a trajectory C\, ...,Ck,Ck+i, set- 
ting ck+i = Ck- The goal of increasing f) in this way 
would be to make the system be closer to the trivial sec- 
tor for the given f3; that is, consider the first example 
of a circle. The winding number is proportional to f3. 
If we equilibrate the winding number at a given /? and 
then double /3, the winding number is now smaller than 
expected for the given f). 

An alternate approach would be to combine this in- 
creasing in f3 with a dependence of s upon the time slice; 
we do not discuss this further as this example will be 
hard for such a case too. 

We now consider an example for which such an ap- 
proach would not work. Consider a system with the same 
states as above for the bouquet of circles example, and 
one additional state called |r). Let the Hamiltonian be 

H = mH bouquet -h\0)(0\ (2.10) 
-t\0)(r\ + h.c. 
+E\r)(r\. 

That is, we have added a tunneling term t connecting 
the state |0) to the state \r) and also added a potential 
term E for state \r). Note that these states (the bouquet 
and added state \r)) are the only states we consider; this 
differs from the previous example where we considered a 



system with a larger number of basis states, and changed 
some parameter to confine the particles motion to the 
bouquet, quenching into a topologically nontrivial sector. 

Note that the ground state of Hb ouquet has energy 0, 
and the first excited state of Hi, OU g Uet has at least energy 
c/M 2 , for some positive constant c. It will be impor- 
tant in what follows to consider also the spectrum of the 
Laplacian on the universal cover of the bouquet of cir- 
cles. This cover is a tree. This tree can be constructed 
as follows: start with a tree T such that all nodes have 
degree 4; that is, the root has 4 daughters and all other 
nodes have 3 daughters. Take this tree and insert M — 1 
additional vertices in the middle of each edge, to con- 
struct a new tree T"; that is, replace an edge between 
two nodes v,w by an edge from v to v\ then from v\ to 
i>2 and so on, up until «m-2 to vm-i, and then an edge 
from vm-i to w. The spectrum of the Laplacian on T 
is in [4 — 2\/3, oo). The exact value 4 — 2^/3 is not that 
important; what is important is that this value is greater 
than 0. Similarly, the spectrum of the Laplacian on T' is 
at least [c'/M 2 , oo) for some positive constant c' . 

We follow this annealing protocol: start at h = t = 0, 
and with E being large and negative, so that the ground 
state at the start is \r). Increase t from to c/2M 2 . 
Then, increase E until E is min(c, c')/2M 2 > 0. Then, 
decrease t to 0. Then, increase ft- to 1 and finally increase 
to to oo; this last stage of the annealing protocol is the 
same as in the previous example. 

Once E reaches its maximum value, the ground state 
of the quantum Hamiltonian is some superposition of |r) 
and some state on the bouquet. Since t is smaller than 
the energy of the first excited state of the bouquet, the 
state on the bouquet has most of its amplitude on the 
ground state of the bouquet. This process can be under- 
stood as an avoided crossing: the energy of \r) crosses 
zero (the energy of the ground state on the bouquet), 
but because of the non-zero t the crossing is an avoided 
crossing. 

As t is decreased, the amplitude of the ground state on 
\r) decreases, until that amplitude is zero once t reaches 
zero. Finally as h is increased, then the amplitude of 
the ground state on |0) increases, until at the end of the 
protocol the ground state is exactly |0), as in the previous 
example. Note that throughout the spectral gap is only 
polynomially small in M. 

Now we consider what happens for the QMC protocol. 
We number our configurations in the natural way: we 
write i()(r) — \r) and write i[)(k,x) = \k,x). Initially, 
we have c,- L = r for all i. However, once t becomes non- 
zero, the trajectory starts to spend time on the bouquet. 
Suppose at some pair of time slices we have Cj = cj = 
r, but for all times fee {i + l,i + 2, — 1} we have Ck ^ 
r. Then, the sequence of configurations Cj + i, Cj_i is 
some sequence that starts and ends at |0) and forms a 
topologically trivial path. To understand this, let us refer 
to a sequence of time slices such as i + 1 , . . . , j — 1 such that 
the particle is on the bouquet and such that Cj — cj = r 
as an "interval" . The length of an interval can increase or 
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decrease under the dynamics but the topologically sector 
cannot change. Initially, there are no intervals. When a 
new interval is created, it is created as a single time slice, 
containing only one configuration, 0. The trajectory on 
this interval is topologically trivial. 

As a result of this constraint on the topology of the 
trajectory during the intervals, the QMC dynamics is un- 
able to distinguish between the given Hamiltonian, and a 
Hamiltonian where we have coupled the state |r) to the 
universal cover of the bouquet. However, the energy E is 
at all times less than the bottom of the spectrum of the 
Laplacian on the cover of the bouquet, since we have cho- 
sen E < c' /M . So there is no avoided crossing simply 
because even at t = there would be no crossing. When 
t is reduced back to 0, the ground state of the Hamilto- 
nian coupling |r) to the universal cover of the bouquet 
is the state \r). Hence, the QMC procedure produces 
the states \r) and does not find the correct ground state 
of the Hamiltonian at the end of the annealing protocol. 
In fact, if the QMC is perfectly equilibrated within the 
trivial topological sector, then we find that the QMC has 
zero probability of finding the ground state. 

This situation is then much worse than the previous 
examples. The failure of QMC can be understood in a 
different fashion. Suppose we have t — 0. Then, the par- 
tition function is a sum of two different quantities, one 
being the partition function of the bouquet and one be- 
ing the contribution exp(— (3E) from the state \r). For 
large f3 >> M 2 , the partition function of the bouquet 
approaches 1, as the bouquet has a unique ground state 
with energy 0. However, this partition function 1 is a 
sum of contributions from exponentially many different 
topological sectors. Any given topological sector has a 
contribution which is exponentially suppressed in /3/M 2 . 
Hence, an algorithm that is unable to equilibrate be- 
tween sectors greatly underestimates the contribution of 
the bouquet to the partition function. 



D. Fourth Example: Bouquet of Circles, Open 
Path in Imaginary Time 

All of our examples so far have been based on a non- 
trivial fundamental group. A natural question is whether 
we can resolve these problems with QMC by using open 
boundary conditions instead. To motivate this, if we con- 
sider classifying closed paths in some space, then the fun- 
damental group 7Ti enters, but if we classify open paths 
in some space (i.e., continuous functions from an interval 
[0, 1] to some space, with no requirement that and 1 be 
mapped to the same point), then the classification of such 
open paths is the same as ttq'. if the space is path con- 
nected, then any two such open paths can be deformed 
into each other. 

In this example, we show that such a QMC algorithm 
still does not necessarily work. The example builds off 
our previous example. We still have a bouquet of cir- 
cles, but now in addition to adding the state |r) as in 



the previous example, we also add another set of Nq dif- 
ferent states, labelled |1,G), \Ng,G), and define some 
expander graph whose vertices correspond to the states 
\i, G). We let the Hamiltonian be: 



H = 



^-^bouquet 

h\0)(0\ 
-t\Q){r\ +h.c. 
+E\r) (r\ 



(2.11) 



-L, 



expander 



VP, 



expander 



-t'\r){l,G\ + h.c. 

where L expan( i er is the graph Laplacian on the expander, 
and Pexpander is a diagonal matrix equal to 1 for states 
on the expander and otherwise. 

First we analyze the properties of the part of the 
Hamiltonian that acts on \r) and on the expander: 



E\r)(r\ 

H~ ^ expander * P expander 

+t'\r)(l,G\+h.c. 



(2.12) 



The Hamiltonian L expander + VP expa nder has ground 
state of energy V, and then a gap to the rest of the 
spectrum. For E < V and t = 0, the ground state of the 
Hamiltonian (2.12) is |r) with energy E. For E < V and 



small t, the ground state is a superposition of some state 
on \r) and some state on the expander. This state on the 
expander has its largest amplitude on |1,G), with the 
next highest amplitude on the first neighbors of 1 1 , G) , 
and the amplitude decreasing away as we consider fur- 
ther neighbors from |1,G). Note that for E < V and 
t << \E — V\, the ground state has almost all of its 
probability on |r), and has only a probability of order 
|i 2 |/|i? — V\ on the expander. However, we can choose 
E close to V so that the following happens: almost all 
of the amplitude of the state is on the expander. That 
is, if we pick a site with probability proportional to the 
amplitude of the ground state wavefunction, then the re- 
sult is very likely to be on the expander. By taking Nq 
large, we can make the probability of the ground state 
wavefunction strongly concentrated on |r) but the am- 
plitude strongly concentrated on the expander. We write 
Eo(E, V, t') to denote the ground state energy of Hamil- 
tonian (12. 121). 



We begin the annealing protocol with t = t' = 0, and 
we choose E to be negative with \E\ » 1 so that the 
initial state is highly concentrated (in both probability 
and amplitude) on \r). We then make t' slightly non- 
zero, and adjust V so that the above regime holds, with 
the probability concentrated on |r) and the amplitude 
concentrated on the expander. From this point on in the 
annealing protocol, we maintain the same difference E — 
V, adjusting V to keep this difference constant whenever 
E is adjusted. We also keep the same If. We then follow a 
very similar annealing protocol to the previous example: 
increase t from to c/2M 2 . Then, increase E and V 
until E (E, V, t') is min(c, d) /2M 2 , while keeping E-V 
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constant. Then, decrease t to 0. Then, increase h to 1 
and finally increase m to oo. 

We can choose the difference E — V so that even when 
we increase t to c/M 2 and E (E, V, t') to min(c, c')/2M 2 
and t, the ground state wavefunction has most of its am- 
plitude on the expander. With open boundary condi- 
tions, using Eq. (1.5) for statistical weights, the vari- 
ables ci and ck are correlated. However, in the limit of 
P » A, in equilibrium the joint probability distribution 
approximately factorizes: 



P(ci,c K ) ~ i>a(ci)^o(c K ) 



(2.13) 



where ipo(c) is the amplitude of the ground state wave- 
function, normalized so that 5Z c V'o(c) = 1. That is, the 
probability distribution of c\ and ck are governed by the 
amplitudes of the ground state, and are very likely to be 
on the expander graph. 

If c\ and ck stay on the expander graph throughout 
the QMC simulation, then the topological sector cannot 
change, and we find the same effect as in the previous 
example that the QMC algorithm will be very unlikely 
to find the correct ground state. So, we must ask for 
the probability that c\ or ck does leave the expander 
graph. By taking Nq exponentially large, we can make 
this probability exponentially small. 

This example shows that even open boundary condi- 
tions need not solve the problem, because we can define 
a Hamiltonian so that c\ , ck are "pinned points" . That 
is, they are fixed to be on the expander graph, prevent- 
ing a change in topological sector. Unlike the previous 
examples, we need to use an exponentially large number 
of states, and so it will take a little more care to define 
gadgets for this example in the next section. 



3. GADGETS: FROM THE TRANVERSE FIELD 
ISING MODEL TO MORE GENERAL 
HAMILTONIANS 

We now describe how to construct transverse field Ising 
model Hamiltonians whose effective low energy dynamics 
realizes the four examples considered above. We first 
consider the first three examples. For these examples, we 
need to construct an effective Hamiltonian with a number 
of states ~ M that scales as some polynomial in N. 

The transverse field Ising systems that we consider will 
not necessarily be planar, and we will allow arbitrary 
dependence of the transverse field and Ising couplings 
along the annealing protocol. We leave it as an open 
question whether one can construct gadgets using planar 
Hamiltonians where only a single parameter, the strength 
of the transverse field, is tuned. 

Consider first a Hamiltonian 



Hglobal — JAF ^ S* Sj + h ^ Sf , 



(3.1) 



the energy is J AF {2N^ - iV) 2 + h(2N t - N). By tuning 
Jafj h, we can arrange for this to have a minimum at 
any desired value of Nf, and with a gap of order unity to 
any states with a different value of Nf. 

However, if ^ 0, N, there are many different states 
with the given value of Nf. We will construct an ef- 
fective Hamiltonian in this space of states that realizes 
the desired examples previously. We first describe how 
to construct a Hamiltonian whose effective low energy 
dynamics realizes a hopping Hamiltonian on a circle: 



H = -t^2\a){a + l\ +h. 



(3.2) 



where a is periodic with period M, and where ... represent 
terms t& ^ a \a)(a + k\ + h.c. for k > 1, with tk decaying 
rapidly in k. Set N = M . Label different sites by i, with 
i being periodic with period N. Pick an integer R. Take 
the Hamiltonian 



H = H, 



global 

+ J' S?S?-BJ2S?- (3-3) 

|t-j|<H-l i 



Tune Jaf, h so that the ground state of H g i b a i has N-f — 
R. First we consider the case B = 0. For J' < 0, we have 
a short range ferromagnetic interaction. For \ J'\ « 1, 
we find that the low energy states consist of states with 
JV-f = 1 and with all the up spins next to each other. 
That is, a state of the form H ... •••• til ■■■ I, where 
there is exactly one sequence of up spins with length R. 
There are N — M such different states. 

Taking B small compared to J', we can treat B in 
perturbation theory, and at second order, the effect is to 
allow the sequence of up spins to move either one to the 
right or one to the left. That is, we flip a spin on one side 
of the sequence from up to down, shortening the sequence 
on one side, while flipping a down spin just past the end 
of the sequence on the other side to up, lengthening the 
sequence on that side. 

Now let us explain why we introduce the parameter 
R. At R = 1, at second order in perturbation theory 
the sequence of up spins can move anywhere: there is no 
distinction between different sides of the sequence and 
two spin flips can connect any two states with exactly one 
up spin. For R = 2, the second order perturbation theory 
result gives us the desired effect, but at fourth order in 
perturbation theory, we can move the sequence anywhere. 
For R = 3, the sixth order perturbation theory allows us 
the sequence to move anywhere and there is a term at 
fourth order in perturbation theory that contributes to 
<2, moving the sequence by two. However, we can take 
R to be a polynomial in N, with a power less than 1, 
and take B polynomially small, and then decays as 
an inverse polynomial of N raised to the fc-th power for 
k < R and is negligible for k > R. 

This lets us realize a Hamiltonian of form (3.2). We 
can add additional magnetic fields by a term hiSf , 
allowing us to realize a Hamiltonian 



where the sum is over all i,j. In an eigenstate with a 
total of iV-f of the spins up, and N — N^- spins down, 



H = -t | a) (a + 1 1 + h.c. + ^ V (a)\a) <a| 



(3.4) 
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where V(a) = X^=I _1 hii giving a linear map from 
hi to V(a), and where again the ... represent terms 
£fcX) a l a )( a + ^1 + h.c. for k > 1. This linear map is 
not invertible, so not all V(a) are possible. However, we 
are able in this way to approximate a slowly varying po- 
tential V(a). This allows us to approximate continuum 
equations such as Eq. (2.1 1 by some discrete approxima- 
tion. It is a slightly different discrete approximation than 
before, as we have hopping tk beyond the first neighbor. 
However, it still approximates a continuum equation. 

Using these continuum equations as a building block, 
we can obtain any of the first three examples. Note that 
we have given gadgets to realize a discrete approximation 
of a one-dimensional continuum equation, while these ex- 
amples require a two-dimensional continuum equation. 
This is a simple modification though. 

To realize the fourth example, we need to overcome 
the fact that Nq should be exponentially large. We use 
the above gadgets to construct a Hamiltonian whose low 
energy dynamics has the states on the bouquet of circles, 
the state |r), and the state |1,G) on the expander graph. 
To realize additional states on the expander graph, we 
add an additional N' number of Ising spins. We add a 
magnetic field to these spins which make them prefer to 
be down, but we add an additional ferromagnetic inter- 
action between each of these spins and the spins used to 
construct the state |1, G). This ferromagnetic interaction 
is chosen so that if the particle is not in the state |1, G), 
then these additional N' spins will prefer to be down, but 
otherwise they have equal energy to be up or down. We 
identify the 2 N states where the particle is in |1, G) and 
these additional N' spins arc in arbitrary states with the 
states |1,G),..., \N G ,G,), where iV G = 2 N ' . 



4. FURTHER OBSTRUCTIONS WITH A 
TRIVIAL FUNDAMENTAL GROUP 

We have noted two possible obstructions to equilibra- 
tion, one based on a nontrivial ttq and one based on a 
nontrivial tt±. A natural question is whether these are 
the only obstructions. This question is not completely 
well-defined, since it is not completely clear in general for 
which space we must compute ttq and 7Ti; we have only 
described it as the space in which the wavefunction has 
non-negligible amplitude in some imprecise way. How- 
ever, in this section we explore this question and identify 
other obstructions. To make it simpler to define the space 
in this section, we will imagine that the Hamiltonian is 
simply the Laplacian on some space. Then, the ground 
state wave function has the same amplitude everywhere. 
This space could be a continuous or discrete space. We 
will construct examples where this space is simply con- 
nected and the Laplacian has only a polynomially small 
spectral gap (polynomially small in the volume of the 
given space) so that particles diffuse in a polynomial time 
but for which diffusion of paths is exponentially slow. 

We give three different examples. The first example 



shows slow equilibration starting from certain initial tra- 
jectories, but unfortunately these conditions are unlikely 
to occur in the QMC annealing since they have very 
small statistical weight. The second example shows slow 
equilibration for initial trajectories with large statisti- 
cal weight. This slow equilibration is due to two effects. 
Although the space of trajectories is connected, so that 
there is a sequence of trajectories connecting any given 
pair, this sequence is very long and this sequence requires 
going through trajectories with much smaller statistical 
weight. This second point is analogous to what happens 
in Eq. (0.1) where there is a path connecting the two 



wells but the path has low amplitude. What we would 
really like is a case where we can connect any trajec- 
tory to any other by a sequence of trajectories with only 
slightly smaller statistical weight, but for which diffusion 
is still slow because it requires a very long sequence of 
trajectories to get from one to the other. In the third 
example, we provide this, and after giving this example 
we then modify it to construct a problem on which the 
QMC annealing protocol will likely fail. 



A. First Example 

In a first example, consider a surface embedded in 
three-dimensions that looks like a dumbbell, with a nar- 
row neck near the middle. This surface is intended to 
have the topology of a sphere, and so 7Ti is trivial. As 
the neck pinches off, the Cheeger constantPSl goes to zero, 
but we will not need to take the neck that narrow. Let 
us suppose that the neck has width ~ 1, while two halves 
of the dumbbell each have linear size ~ L and area ~ L 2 . 
Then, the Cheeger constant is ~ l/L 2 . We will take L 
only polynomially large. Now, imagine a path that takes 
an imaginary time /3 to wind around the neck (3 times. 
This path can be shrunk to a point, by pulling it off the 
neck, but because of the large number of windings around 
the neck, changing the number of times it winds around 
the neck is exponentially suppressed in L. 

While this example allows us to have a polynomially 
small gap for a particle, but have an initial path that 
takes an exponential time to relax to equilibrium, this is 
slightly unsatisfactory, because the initial path is expo- 
nentially unlikely to occur in equilibrium, having a very 
small statistical weight. 



B. Second Example: Group Presentation and 
Sequence of Trajectories 

We fix this problem in the second example, which is 
based on group theory. Recall the concept of a presen- 
tation of a group. This is defined by certain generators, 
gi, ... and certain relations, ri, where a relation spec- 
ifies that a certain product of the generators is equal to 
the identity. For any presentation of the group, we can 
define a 2-complex, called the "presentation complex" , 
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whose fundamental group is the group specified by that 
presentation^. This complex is constructed as follows. 
There is one 0-cell. For each generator, we attach a 1-cell, 
giving a bouquet of circles, and then for every relation 
we attach a 2-cell whose boundary is attached to the 
1-cells corresponding to the generators in that relation. 
While this construction gives a complex, we can embed 
the complex without self-intersection in five dimensions 
and then construct a four manifold with the same funda- 
mental group. 

Note that previously, when we considered the bouquet 
of circles, we had some number M of discrete states on 
each circle. We can imagine doing something similar 
here, giving a finer subdivision of the complex. Then 
a trajectory in the QMC simulation is some closed path 
on the 1-skeleton of this subdivided complex (we use the 
term "path" rather than "trajectory" here for consistency 
with terminology in graph theory and to emphasize that 
the path is on some complex that we have defined from 
a group presentation, but we use "trajectory" later when 
giving an analysis of a QMC algorithm for a system in- 
cluding both such a complex and some additional states). 

Moving the path by local updates corresponds to us- 
ing 2-cells to change this path. However, we can avoid 
explicitly doing this subdivision and instead just speak 
of a path as a word, where a word is a sequence of gen- 
erators. To do this, note that we can deform any path in 
the subdivided complex onto the 1-skeleton of the orig- 
inal complex, by deforming it within each 2-cell of the 
original complex. We can arbitrarily decide some way to 
do this for each 2-cell of the original complex. Then, us- 
ing a sequence of local updates changes the undeformed 
path, and may potentially change the deformed path. 
This change in the deformed path corresponds simply to 
some relation (or its inverse). So, for simplicity, we speak 
of a path as a word in the generators. Local updates cor- 
respond to using a relation, or its inverse, or cancelling a 
generator against its inverse (so g a g^ 1 can be cancelled) 
or the inverse of this cancellation. 

It is knowrp2122l that it is undecidable whether or not 
a finite group presentation is trivial. Note that a group 
presentation will be trivial if and only if for every gen- 
erator g a , we can take a path that winds exactly once 
around the 1-cell corresponding to that generator and 
deform that path to the identity. If there were some suf- 
ficiently slowly growing bound (indeed a bound by any 
computable function) on how long this sequence of paths 
might be, then we could decide whether or not the group 
presentation was trivial, simply by trying all sequences 
shorter than a certain length. Thus, the length of these 
paths must increase very rapidly for certain presentations 
of the trivial group. Note that this example improves on 
the previous example, in that it might take very long for 
a path that simply winds once around a given 1-cell to 
turn into the identity, and this path winding once around 
a given 1-cell has non- negligible statistical weight, as op- 
posed to the previous case where the path that wound 
many times around the neck of the dumbbell had very 



small statistical weight. 

So, this gives us a sequence of examples where the 
diffusion of the paths becomes much slower than the dif- 
fusion of particles, despite having trivial ~k\. There is 
still one unsatisfactory aspect of this example. Namely, 
the sequence of paths to deform a given generator to the 
identity might involve increasing the length of the word 
by a large amount. That is, the sequence of paths will be 
of the form W\, W2, icjc where w± is one of the genera- 
tors, wk is the identity, and W2, wjsr-i are some other 
words in the group, and relations arc used to move from 
Wi to u>j+i, and where not only will K become large, but 
possibly also the length of some of the Wi will also be- 
come large. As this length becomes large, the statistical 
weight of the corresponding path Wi goes to zero. This 
does of course even further slow the diffusion of paths, 
but we might be interested in an example where this 
weight does not go to zero as rapidly. 

So an interesting question is whether there is a se- 
quence of presentations of the trivial group for which we 
can find a sequence of paths from every generator to the 
identity, and for which the minimum length K of such a 
sequence diverges rapidly, but for which every word Wi 
in that sequence has at most polynomial length. Note 
that in this case, K is bounded by an exponential of a 
polynomial, because there are only that many possible 
words of polynomial length. 



C. Third Example 

Consider the group presentation with generators 
gi,...,g n and relations gig^ 2 , fi^ 2 , g n -i9n 2 , 9n- 
This is a presentation of the trivial group, as follows. 
The last relation is the generator g n . We can multiply 
the second-to-last relation by g\ to get the relation g n -i 
and then multiply the third-to-last relation by g^-i to 
get the relation g n -2, and so on. 

Further, given any word w, we can turn that word into 
the trivial word by a sequence of words using the relations 
without any word in that sequence having length more 
than n longer than the original word. Let us illustrate 
how to do this for the word w — g\. Then consider the 
following sequence of words: 

si, gl, 5253, 5233ffi -, g2---g n -igl, g2-.g-n.-1, 
g2--g n -2g 2 n , g2—g n -2, 92—gn-afffl-i, g2-g n -zgn-\g 2 n , 

and so on. At no point in this sequence does the length 
of the word get longer than n. The algorithm to generate 
this sequence starting from g\ is always to take the last 
generator in the sequence and if it is g n then remove that 
generator, but if it is some other generator g a to replace 
it with g 2 +1 . We can use the same algorithm for any 
starting generator g a . Then, for any given starting word 
that is not a single generator, we can apply this sequence 
to turn the last generator in the original word into the 
identity, then the second-to-last, and so on. 
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FIG. 4.1: The Desperado puzzle. The puzzle rests on a flat 
plane, shown. The objective is to separate the string from 
the wood pillars. A "topologist's solution" to the puzzle is to 
deform the pillars instead of the string, which makes it appar- 
ent that the string is in a topologically trivial configuration. 
However it requires an exponential time to move the string 
between different configurations. 



The above sequence takes an exponential number of 
moves to turn g\ into the identity. We can prove that 
any sequence requires an exponential number of moves 
as follows. Consider any word written as 9a\9al---9aZ,^ 
for some integer m. Define the weight of the word to be 



X>(2 



n— 



!)• 



(4.1) 



This weight is invariant under conjugating the word by 
any generator, or by inserting a generator and its inverse 
anywhere, as one adds generators to the word with both 
positive and negative powers when doing this and these 
terms cancel in the weight. Using a relation can change 
the weight by ±1. The word g a has a weight 2"~ a+1 — 1, 
so it takes at least 2™~ a+1 — 1 moves to turn that word 
into the identity. Note also that even if we add in the 
relations g a 9b9a X % X ^ specifying that all generators com- 
mute, we still have the same lower bound on the number 
of moves. 

Interestingly, this group presentation was inspired by 
considering a puzzle made of woo d and string, called the 
Desperado puzzle^. See Fig. 4.1 A similar effect occurs 
there, though that puzzle involves a space that is not 
simply connected. 

Now, having constructed this group presentation, we 
construct an example where QMC will fail to find the 
ground state. This is similar to the "third example" pre- 



viously using a bouquet of circles in subsection 2 C but 
here we have a trivial fundamental group. Let C be the 
complex constructed above corresponding to this group 
presentation. Let C be a copy of C. We attach the two 
complexes to each other at a point by identifying the 0- 
ccll in C with that in C . This corresponds to a group 
presentation with generators gi, g n 
tions 



. . . , g' n and rela- 



2 > 929 3 2 , ...,g„_i5„ 2 ,g„, 



g'lig'z) 2 , 92(93) 2 »->ffn-i(fn) 2 >s«- 

We can also add in relations gagb9a 1 9b 1 ana - 



5a5h(ffa) _1 (g'b)" 1 ^ we choose, but we will not add in 
any relation gaU^Sa \9b) • Having defined this complex, 
we then subdivide the complexes and define a quantum 
Hamiltonian H comp i ex . Each basis state that H complex 
acts on corresponds to some 0-cell in the subdivided com- 
plex. Let |0) be the 0-cell in the subdivision that corre- 
sponds to the 0-cell in the original complex. Then, add 
in an additional state \r) and consider the Hamiltonian 



H = mH bouquet - h\0}(0\ 
-t\0)(r\ +h.c. 
+E\r)(r\. 



(4.2) 



We follow the same annealing protocol as in the "third 
example" with the bouquet of circles, though the spe- 
cific values we pick for h, t, E will be different. Consider 
what happens for the QMC protocol. Initially, we have 
Cj = r for all i, corresponding to the state |r). How- 
ever, once t becomes non-zero, the trajectory starts to 
spend time on the bouquet. Suppose at some pair of 
time slices i,j, we have Cj = Cj — r, but for all times 
k e {i + l,i + 2, — 1} we have 7^ r. Then, the 
sequence of configurations c^+i, ...,Cj_i is some sequence 
that starts and ends at |0) and forms a path. This path 
corresponds to a word in the generators. Let us write 
this word as w = wiw' 1 W2W / 2... where w a is a word in 
the generators gi , . . . , g n and w' a is a word in the gener- 
ators g[, <?^. Now, every path is topologically trivial, 
because the presentation describes a trivial group. How- 
ever, certain paths, namely those in which any of the w a 
or w' a have exponentially high weight cannot occur with- 
out taking an exponential length of time in the QMC. 
Arbitrarily, let us say that the weight is high if it is more 
than 2 n / 2 . The number of words of length I can be very 
crudely estimated as (4n) , since each generator in the 
word can be any of the generators g a or g' a or any of 
the inverses of these generators. Taking into account the 
ability to reduce a word by cancelling a generator against 
its inverse (say, gag^ 1 ) slightly reduces the base of the 
exponent, by an amount which is o(n). However, if we 
restrict to words in which no w a or w' a has weight more 
than 2™/ 2 appears, then the number of such words is c l 
for some c < An (in fact, c/An converges to some num- 
ber less than 1 in the limit of large n). So there are 
exponentially fewer such words. 

So, by restricting to these configurations where each 
w a or w' a does not have high weight exponentially reduces 
the statistical weight of the sum over paths corresponding 
to words w of length I. Effectively, this means that the 
QMC algorithm sees that the spectrum of states in the 
complex does not start at zero energy (the true ground 
state energy of the Laplacian on the complex) but in 
fact starts at some higher energy. So, similar to before, 
we can find an annealing protocol for which the energy 
E is greater than zero (so that the quantum annealing 
procedure finds the ground state on the complex) but the 
energy E is less than the bottom of the spectrum that the 
QMC sees (so that the QMC algorithm instead produces 
the state \r) at the end of the annealing protocol). 
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We can also take this example and do something simi- 
lar to what we did in the "fourth example" previously in 
subsection |2 D[ and add an expander graph in addition to 
the state \r) to construct an example which has a trivial 
fundamental group for which QMC fails even with open 
boundary conditions. 



Then we can write H = J^i^i^ where [Hi,Hj] = for 
— > I- Note that Hi and i?i+2 both have support on 
the i + 1-th time slice, but they still commute. 

A. Lower Bounds on Equilibration Time 



5. ANALYTIC RESULTS ON EQUILIBRATION 

We now provide some analytic results on equilibration, 
including both positive and negative results. Consider a 
continuous time Markov dynamics with transition rates 
from state c to state d given by: 



Mi ^ T dc = J dc exp(-(E d -E c )/2), 
where J is a symmetric matrix. Then, 



T 



d^c 



J dc eM-{E d -E c )/2). 



(5.1) 



(5.2) 



These rates satisfy detailed balance with stationary dis- 
tribution P c cx exp(— E c ). 

To analyze equilibration, we make a non-unitary 
change of basis to transform T to a symmetric matrix L 
by right-multiplying by exp(— E c /2) and left-multiplying 
by the inverse matrix. The resulting matrix L has matrix 
elements 

c^d -> L dc = exp{E d /2)T dc exp{-E c /2) (5.3) 

= Jdc- 

This matrix is symmetric and real and has the same 
eigenvalues as T. Write H = —L. Then, H has at least 
one zero eigenvalue, and a bound on the second eigen- 
value gives an upper bound on equilibration time (the 
equilibration time will be bounded by the inverse of this 
eigenvalue times the logarithm of the size of this matrix, 
and for our purposes this logarithm grows only polyno- 
mially in K, N and j3). Note then that in this section, 
we are using H to refer to a Hamiltonian defined from 
H = —L. Previously we used H to refer to a quan- 
tum Hamiltonian. To avoid ambiguity, in this section we 
will write Jji uantum to refer to the quantum Hamiltonian 
considered previously. 

For local Monte Carlo moves for our problem with sta- 
tistical weight (1.3), we write the statistical weight as an 



exponential of an energy (for our particular choice of sta- 
tistical weight, some energies might be infinite as some 
transitions are forbidden but in this section we will sim- 
ply replace those with very large energies). Then, we 
can write H as a Hamiltonian for a one-dimensional spin 
chain, with K spins, one spin per time slice. We write Cj 
for the value of the configuration on the i-th time slice. 
We take J = J,, where Jj is some symmetric matrix 
which is supported on time slices i— 1, i, i+1 and does not 
change the value of Cj_i or Cj+i. Let Jj(c^, Ci\ci-i, Cj+i) 
denote matrix elements of Ji from c, ; to for given val- 
ues of Cj_i, Cj+i. We write E as E = J2i B, i j_|_i(cj, i + 1). 



We write an orthonormal basis of states as \c\, ck)- 
Consider this orthonormal set of states: 



i ci) = £ z{ C1 )-vi^(- E ^-- CK) y±) 



C2,C3,...,C K 

x|ci, ...,c K ) 



where E(a, ...,c K ) = Y*Li E i,i+i{ c h <k+i) and 

Z( Cl )= ]T expMfo, ...,«*)). (5.5) 



C2,C3,...,CK 



Note that c\ is not summed over in either equation. 
Then for c[ ^ ci, 

(c[\H\ Cl ) (5.6) 

c 2 ,c 3 ,...,c K 

( E{d,...,c K ) + E(c' 1 ,...,c K ) \ 

X GXp I ~~ 1 



-J(ci,ci). 



Also, 



(ci\H\ci) = ^2 ^( c i) c i 




(5.7) 



Let H be the operator H projected into the space of 
states |ci), \ck), so the abo ve equations define the ma- 
trix elements of H. Eq. (5.7) shows that H can still be 



regarded as arising from equilibration of some Markov 
dynamics with an effective J(c' l7 c\) playing the role of J 
and with — log(Z(ci) playing the role of the energy E. 

Note that the Markov process is defined by an energy 
E and a matrix J. If we have two different Markov pro- 
cesses, both using the same energy E but one using a 
matrix J and the second using a matrix J' such that ev- 
ery matrix element of J' is greater than or equal to the 
corresponding matrix element of J, the corresponding 
Hamiltonians obey the inequality that H' > H . Above, 
we have an H which is the Hamiltonian corresponding 
to a Markov dynamics with energy log(Z(ci) and matrix 
J. Let us use D to denote the matrix with the same 
off-diagonal matrix elements as u quantum but which is 
zero on the diagonal. Define a matrix J' = —CD where 
C is the smallest constant such that every entry of J' is 
greater than or equal to the corresponding entry of J. 

Let E be the ground state energy of u quantum . We 
now show that in the limit of large /?, 



H' = c(H quantum - E Q ) + 0(exp(-/3A)d 



(5.8) 
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where N is the number of sites and d is the dimension 
on a single site. By construction, the off-diagonal matrix 
elements of H' are the same as the off-diagonal matrix 
elements of H qwantum , up to multiplication by a factor 
of C. Now consider the diagonal matrix elements. Let 
tpo(c) be the ground state wavefunction. In the large /3 
limit, Z(ci) = |i/- (ci)| 2 + O(exp(-0A)d N ). This error 
term 0(...) becomes negligible for f3 = poly(iV). In many 
cases, this error bound 0(exp(— f3A)d N ) is in fact a large 
over-estimate of the true error. Then, we find that 

(V>(c)|#'Mc)) (5.9) 

= -CY(^(c')\H quantum \^{c))'^fl 

+0(exp(-f3A)d N ) 
= C{(i,{c)\H quantum \^{c)) - E ) + 0(exp(-^A)rf A '), 

where we used the fact that tpo(c) is an eigenstate of 
H quantum , so that Y, C '^c(' l l ; ( c ')\ HquantUm \^( c )}Mc') = 
(E - {tp(c)\H guantum \tp(c)))i(}o(c). This shows Eq. $5$ ). 

So, the lowest eigenvalue of H' equals CA. Since H' 
upper bounds H, this gives an upper bound to the lowest 
eigenvalue of H. Note that the matrix elements of J are 
upper bounded by the corresponding matrix elements of 
J. So, C is upper bounded by the smallest constant c 
such that the matrix elements of — cD are greater than 
or equal to the corresponding matrix elements of J. For a 
natural choice of local updates in which we update from 
state c to d if there is a term in the Hamiltonian connect- 
ing those two states, this constant c will be of order unity. 
So, in this small eigenvalue of H quantum implies 

a small eigenvalue of H and hence a slow relaxation. 

This result is perhaps not surprising, though, as for 
some values of ft we can also derive this result using expo- 
nential decay of correlations. If Hi uantum has a small gap 
A compared to /?, then the equilibrium state has long- 
range correlation in imaginary time. Using our interpre- 
tation of H as a Hamiltonian for a spin chain, this corre- 
sponds to a long-range correlation in the spin chain. Us- 
ing the exponential decay of correlations^ for a gapped 
H, we can bound the gap of H. However, the explicit re- 
lation between H' and JJ quantum here may be interesting. 
Further, if A is small but /3A is large, then there may not 
be long-range correlations in imaginary time for diagonal 
operators. Consider a simple Hamiltonian on a single 
spin coupled to a magnetic field h: H quantum = hS z , 
with gap A = h > 0. Then, for (3 » A, the QMC prob- 
ability distribution is dominated by the trajectory with 
the spin pointing down in all time slices so there are in 
fact no long-range correlations. 



B. Lower Bounds on Eigenvalue 

We need some preliminaries first. We first derive some 
lower bounds for eigenvalues of one-dimensional quantum 
spin chains in Eq. (5.21), which can be understood as 



some sort of renormalization procedure. We then apply 
to the specific spin chain arising from the QMC dynam- 
ics. 

We begin with the following result: let Pi, Pa be pro- 
jectors. Then, for any real number x with < x < 1 we 
have: 



xPi +P 2 > 



1 



-(1-P 1 )P 2 (1-P 1 ). 



(5.10) 



To prove Eq. (5.10), note that by Jordan's lemma we 



can find a basis such that both Pi , P 2 become block di- 
agonal, with the blocks having size either one or two. We 
prove this equation for each block. In a block of size one, 
then Pi,p2 in that block equal or 1, giving 4 different 
possibilities for that block. One can explicitly check all 
four possibilities. Now consider a block of size two. We 
can write P\ in this block as 



1 




and Pi in this block as 



Let y 



cos 2 (6») cos(0)sin(0) 
cos(0)sin(0) sin 2 (0) 

So, we need to check that 



x + cos 2 (0) cos(0)sin(0) 
cos(0)sin(0) (l-y)sin 2 (0) 



> 0. 



(5.11) 



(5.12) 



(5.13) 



This matrix is Hermitian. For the given choice x, it has 
a positive trace. So, it suffices to check that the determi- 
nant is positive. The determinant equals 



(i + cos 2 (0))(l-y)-cos 2 (0) sin 2 (0) 



(5.14) 



Note that sin 2 (0) > 0. The quantity in brackets is equal 
to x(l — y) — ycos 2 (0). Since cos 2 (0) < 1, this quantity 
in brackets is greater than or equal to x(l — y) — y, which 
for the giv en cho ice of y is equal to 0. 
For Eq. (15. 101), it follows that 



Pi + P 2 > aPi 



1 



2-a 

for all < a < 1 or that 

1 a 1 — a 

^Pi+P 2 > ~P+~ 

I 2 6 — a 



l-Pi)P 2 (l-Pi), (5.15) 



(1 - P X )P 2 (1 - Pl). (5.16) 



Consider a one-dimensional Hamiltonian of K sites for 
some K, 



H = P 1 +P 2 + ... + P K , 
where each Pj is a projector and where 



0. 



(5.17) 



(5.18) 



Assume for simplicity that K is even. We identity the 
K-ih and the 0-th sites and the distance in the above 
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equation should be taken with this periodic identification. where 



Note that one way for Eq. ( 5.18 1 to hold is if Pi acts only 
on the i-th and i + 1-th site. However, later we will 
consider a more general way in which this equation can 
hold. 
Then, 



H = 



i=0,2,. 



1 



1 



:P 



i+2 



(5.19) 



where the sum is over all even i less than K. Write 
Qi = 1 - Pi. Note that \P t + \P l+2 > \{l - Q l Q i+2 ) 
and also (1 — QiQi +2) > \ {Pi + Pj+2)- (Th is is the place 
where we use Eq. (|5.18[)). So, by Eq. (|5 16[), 



(5.20) 



-Pi + P i+ i + -jPi+2 

> ^(1- QiQi+2) +Pi+l 
(X 1 — Ct 

7t(1 — QiQi+2) + 7, Qi,i+lQi+2Pi+lQi+2Qi 

2 A — a 

> ^(Pi + p i+2 ) + L 



-QiQi+2Pi+lQi+2Qi 



So, 



where 



and 



_ ^ even 



l - a jr 
7, tLodd, 



Pleven ^ ^ Pi 



(5.21) 



(5.22) 



Hodd — ^ QiQi+2Pi+\Qi+2Qi- (5.23) 

i=0,2,... 

Note that H even and H dd commute with all Qi. 

We now apply these results to equilibration. Let Ao 
be minimum over i of the smallest non-zero eigenvalue of 
Hi. This quantity \ characterizes how quickly a time- 
slice can equilibrate to its neighboring time slices. So, 
H > Ao Yli Pi j where Pi = 1 — Qi and Qi projects onto 
the zero eigenspace of Hi. The operator Pi is supported 
on sites i — 1, ...,£ + 1 but it does not change the value of 
Ci_i or Cj+i. 

Let us assume that for any given c,_i and Cj+i that P, 
has a unique ground state on sites + Then we 

can write a basis for the eigenspace in which all even Qi 
are equal to 1 by states of the form 

(f>2(ci,C2,C 3 )<f>4,(c3,C4 ) C 5 )...\ci ) C2,... ) C K ), (5.24) 

c 2 ,c 4 „... 

Note that ci, C3, ... are not summed over and there is ex- 
actly one such eigenstate per choice of ci, C3, .... Here we 
have defined 



0i(Cj_i,Ci,Ci + i) 



exp 



Ei- li i(ci-i,c i )+Ei t i +1 (ci,Ci + i) 



Zi(ci-i,Ci +1 ) (5.25) 
^ exp(-(-Ei_i,i(ci_i,Ci) + S i)i+1 (ci, Cj+i))). 



We write the state in Eq. (5.24) as |ci,C3, ...) in a slight 
abuse of notation (if both odd and even appear in a 
ket then it is a state of the whole system, but if only odd 



Ci appear in the ket then it is a state of form ( 5.24 )) . 

Now compute H oc id for this spin chain. The operator 
QiQi+2Pi+\Qi+2Qi is supported on sites i— + 
3. We evaluate its matrix element between two states 
\c%,C3,...) and \c[,c' 3 , ...) that agree on all sites except 
site i + Since this matrix element only depends upon 
c,_i, Cj+i, Cj+3, we will not write any other c,-. For 
notational simplicity, let us fix i = 2. Then for ci 
we have 



(ci,c 3 ,c 5 |P i+1 |ci,c 3 ,c 5 ) (5.26) 

- [Z 2 {ci, C 3 )Z4(c 3 , C 5 )J (#2(C1, C 3 )^4(c 3 , C5)] 

X X] J ( C 3: C 3|C2,C 4 ) 

C2.C4 

f /" c , \ E 2 , 3 (c 2 ,c 3 ) + E 2 , 3 {c2,c 3 )\ 
x |exp^-£?i )2 (ci, c 2 ) — ^ — -J 

/ rp ( \ ^3,4(4^4) +^3,4(C 3 , C 4 )\\ 

x exp ^-^4,5(04, c 5 ) J j 

-J(c 3) c 3 |ci,c 5 ). 



Also 



(ci,C3,C 5 |P 4+ i|ci,C3,C 5 ) 



(5.27) 



X] ^( c 3> c 3|ci,C 5 )l 



, Z 2 (ci,c^)Z 4 (c 3 ,c 5 ) 

Z2(ci,C 3 )Zi(c3,C 5 ) 



Z l (c l ^ 1 ,c l+1 ) 1 / 2 



This procedure can be regarded as a kind of renor- 
malization procedure. We have a new H oc id, which 
acts on a spin chain with half as many sites. This 
Hodd can still be regarded as arising from equilibra- 
tion of some Markov dynamics with a "renormalized" 
J(c 3 , c 3 |ci, c 5 ) playing the role played by J(c 3 , c 3 |c 2 , c 4 ) 
and with — log(Z 2 (ci, c 3 )) playing the role of the term 
in the energy which depends upon a pair of neighboring 
sites. 

The calculation in this subsection is very similar to the 
one in the previous subsection, in that in both cases we 
calculated a renormalized J from the original J, although 
here we use it to give an upper bond and in t he pr evious 
case we used it to give a lower bound. Eq. (5.21) lower 
bounds H in terms of H even and H a( id- We can iterate 
Eq. (5.21). That is, we can apply this equation to the 
Hamiltonian H oc id and so on defining a renormalization 
procedure that halves the number of sites at each step. 
The constant factor leads to an exponential decrease 
in the Hamiltonian from one step to the next. We can 
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pick a small value of a to make this constant close to 
1/3. Since the number of renormalization steps is only 
logarithmic in K, this constant factor produces only a 
polynomial decrease in H. 

So, we can lower bound the lower non-zero eigenvalue 
of H by the polynomial factor from {^E^Y 062 ^, mul- 
tiplied by the product of Ao over steps. The bond be- 
comes ineffective if this Ao becomes small. To give an 
example of how Ao can become small, consider the fol- 
lowing toy model. We have configurations c labelling 
angles on a circle, so < c < 2ir. Suppose that the 
statistical weight vanishes if \a — Cj+i| > 0.26-7T for any 
i and is equal to 1 otherwise. Then, a possible trajec- 
tory is c\ = 0, C2 = 7r/4, C3 = 7r/2, C4 = 'Sir/ 4, C5 = n, .... 
Another possible trajectory is c\ = 0,C2 = — 7r/4, C3 = 
— 7r/2, C4 = — 37r/4, C5 = 7r, .... Both trajectories have the 
same ci and C5, but if those c\ , C5 are held fixed then local 
updates cannot move from one trajectory to the other. 
In this case, we find that J' has no matrix elements be- 
tween C3 — n/2 and C3 = —it/ 2 because if we choose a 
value of C2 such as 7r/4 which is consistent with C3 = 7r/2, 
then it is inconsistent with c 3 = —tt/2. 

Note also that it may not be necessary to run the renor- 
malization until log(i£") steps, if K is sufficiently large 
compared to j3. After a large number of steps, the Hamil- 
tonian may approximately "decouple" H into a sum of 
single site Hamiltonians, as the statistical weight will be 
a product of |^o(ci)| over sites that remain. 

6. DISCUSSION 

The work in this paper attacks the question of the com- 
putational complexity of the adiabatic algorithm with no 
sign problem. Without the "no sign problem" restriction, 
Ref. [7] shows that the adiabatic algorithm is equivalent 
to the circuit model. With this restriction, a natural 
conjecture is that the adiabatic algorithm can only solve 
problems in the complexity class BPP. While we have 



no definite results on the complexity, we have shown 
that the simplest way to place the adiabatic algorithm 
in BPP, by path integral QMC with local updates, does 
not work. We have shown that it is possible to have a 
path in parameter space of quantum systems with a spec- 
tral gap that is only polynomially small and which have 
no sign problem, but for which QMC has exponentially 
slow equilibration for the natural choice of annealing pro- 
tocol. While the existence of obstructions to equilibrat- 
ing QMC based on a nontrivial fundamental group are 
well-known, for example when studying bosons moving 
on a torus, and much effort has been devoted to nonlocal 
updates which might alleviate these problems, we have 
shown much stronger effects using a fundamental group 
which is a free group on two or more generators. These 
stronger effects prevent QMC from accurately calculating 
the ground state energy, even at large /3. 

Perhaps more surprisingly, we have shown that slow 
equilibration of QMC can happen even if the fundamen- 
tal group is trivial. These examples are still based on re- 
sults in topology, though, as they exploit the connection 
between a group presentation and a simplicial complex. 

Finally, we have provided some analytic results con- 
necting the spectral gap of the quantum Hamiltonian 
to the relaxation. Interestingly, this implies that our 
third example in section [4] gives a Markov dynamics 
whose corresponding Hamiltonian (that is, the Hamil- 
tonian defined from the Markov dynamics, rather than 
jjquantum^ ^as an exponentially small spectral gap but 
has only short-range correlations for far separated spins, 
because far separated spins correspond to very different 
imaginary times. 
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Appendix. Recursive group presentations from low dimensional topology 



by M. H. Preedman 



The purpose of this short appendix is to give some geometric/topological context to the phenomenon 
(see Section 4 of the main paper) of group presentations which, in geometric language, contain short 
contractible loops 7 which bound only exponentially large area disks A. Our examples all have the 
property that while area A ~ e len s th (7) j /\ j s "thin" in that 7 can be swept over A with only a linear 
increase in length. This thinness property appears to be quite generic: it is the basis for many "string 
puzzles"^ and may have some yet unexploited implications in topology. 



Appendix A: Solenoid 



Let us start with the dyadic solenoid and compare with the trivial group presentation of Section 4C: 

I 5i = 9l,---,9n-i =dl,g n } (A.l) 
The solenoid X is the continua X = l~)^ Si, where each Si is a solid torus and Si + i is embedded (with 



no normal twists) in Si, as shown in Figure A.l 



FIG. A.l: 

We are actually concerned with the finite stages X n = S n C S 1 x B 2 = So C R 3 C S 3 , where S 3 
denotes the 3-sphere = ]R 3 U 00. For example, X 3 wraps eight times through the meridian 7 of Si as 
shown in Figure |A~2} 

While 7Ti(S' 3 \(7 U X n )) is an extremely complicated nonabelian group (It is an amalgamated free 
product of n — 1 copies of ni(Si — S^+i), which can be computed by the Wertinger algorithm as: 

{xi, x 2 ,Vu 2/2, 2/3 I yi 1 X2 1 yiXi,y3 1 x~ 1 y 3 x 2 ,y 3 x~ 1 y 1 x 1 ,y^ 1 y^ 1 y 3 y 1 }), (A.2) 

the fundamental group of (S 3 \X n ) = Z, the integers, since X n C M 3 is an unknotted solid torus. This Z 



is precisely the same as the presentation (A.l I with the final relation g n omitted. Now attach a 2-handle 
(B 2 x /, dB 2 x /) to the meridian of S n (corresponding to the relation g n ). This partially fills the toroidal 
"hole" so the "hole" is now just a 3-ball B, and Y := (S 3 \X n ) U 2-handle is the complement S 3 \B, also 



a 3-ball. In particular, tti(Y) = {e} with presentation (A.l). 

We are not required to use the metric from S 3 . Let us instead take all units (Si — Si+i) isometric with 
dSi = dS i+ i = S^ nit x S^ nit and the final S n = S^ nit x B 2 nit , products of unit circles and disks in the 
Euclidean plane. Thus, any surface (S,i9) C (S n ,dS n ) representing the generator S of H2(S ni dS n ; Z) 
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FIG. A.2: 



has area > tt. (Since the composition E «-> 5 n = x -Dunit F ~~^ ^unit i s locally area non-decreasing.) 
Similarly, any surface representing kS must have area > kn. 

Evidently, the linking number L(j,S n ) — 2". So for homological reasons, any disk A (or even any 
oriented surface) bounding 7 must contain a (possibly disconnected) subsurface representing 2 n S, and 
hence area(A) > 2 n ir. 

On the other hand, the obvious planar disk A bounding 7 and cutting through S n in 2™ meridional 
disks Si, 1 < i < 2™, can be deformed to A' by sliding each Si along S n until it enters the 2-handle 
spanning a meridian to S n , to lie in Y. Metrically A' has 2™ "thumbs" of area > 7r each and height < n. 




FIG. A.3: 



The thinness property originally deduced from the presentation ( A.l ) can be understood geometrically: 
although A' has exponentially many "thumbs" of size O(l), we may avoid stretching 7 (more than 
linearly) by passing it over the "thumbs" one at a time. 



Appendix B: Half gropes and Devil's staircase 



Here we describe a 2-complex, sometimes called a half grope, which is at the heart of the "Desperado" 
or "Devil's ladder" string puzzles. We consider only genus one examples: Figure [BTi] shows a half grope 
H of height n = 4 with a possible cap disk indicated with dotted lines. 

The building block is a punctured torus. If we take the puncture to be the size and shape of a 
longitude circle we may glue n copies together as shown to produce H n . is the half grope union 
a final disk bounding the topmost longitude. Using [a, b] to represent bab~ 1 a~ 1 and simply integers to 
denote generators, we may present tt\{H^) and ■K\{Hf) as follows. 



TTi (G 4 ) = {1,2,3,4,5,6,7,8 I 1 = [3, 4], 3= [5, 6], 5= [7, 8]}, and 



(B.l) 
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FIG. B.l: 



tt(G+) = {1, 2, 3, 4, 5, 6, 7, 8 | 1 = [3, 4], 3 = [5, 6], 5 = [7, 8], 7 = e} (B.2) 

and similarly for all H n and if+ . 

The loop 7 = [1,2] and the relations tell us immediately that 7 lies in the n-stage of the lower central 
series of iri(H n ) = Free(l, 2, 3, 4, ... , 2n), where we consider ordinary commutators to be in stage 1 of 
the 1. c. s. On the other hand, 7 = 66 717 (iJ+) = Free(2, 4, 6, ... , 2n). To see that 717 (if n ) and 7r 1 (7J+) 
are free, observe that they collapse to one-dimensional graphs, e.g., a punctured torus H\ collapses to a 
wedge of two circles. 

Proposition B.l. Any map f of a disk bounding 7 into H+ must pass over the cap at least 2™ times, 
i.e., if p is the origin of the cap, / _1 (p) must consist of at least 2™ points, which we may assume to be 
transverse. 

We need: 

Lemma B.2. Setting 7 = dH+ and f : (D 2 ,d) -> (H + ,~/), f 1-1 on d, then > 2. 

Proof. It is readily computed (by a Mayer- Vietoris sequence) that H^iH^ , 7; Z) = Z and that d : 
H2(Hx , 7; Z) — > 1(7; Z) is an isomorphism. Consequently, any two null homologies of 7 are themselves 
homologous (up to sign): \f(D 2 )] = ±[i?i] € Hi{H^ , 7; Z). Since Hi is disjoint from p, the homological 
intersection number tK/(-D 2 ),p) = \H\ flf>| = 0. But 7 is homotopically essential in Hi, so |/' _1 (p)| > 0, 
for any /' homotopic to /, for if /' misses p it may be deformed into H±. Since the signed sum of inverse 
images for /' generic is 0, \f~ 1 {p)\ > 2. □ 

For our induction we actually require a slightly stronger: 

Lemma B.3. Let f : (P,dP) — > [H^ ,7) be a map of a compact planar domain inducing degree = ±1 
on d, f*[dP] = ±1 G Hx(r,Z) = Z, then \f- l {p)\ > 2. 

Proof. The only new point is to show that the image f(p) cannot lie in the punctured torus, Hi. If / 
did factor through Hi then / : (P,dB) — > (Hi,d) is a degree one map. Let a, (3 be the dual meridian 
and longitude loops on Hi, respectively, and let a and b be their transverse inverse images a = f^ 1 (a), 
b = / _1 (/3). Applying the degree(/) = 1 property to the single transverse intersection x — a n f3 we see 
that |/ _1 (a;)| = intersection number (a, b) = 1, contradicting the planarity of P. □ 



Apply B.3 to Hk with all higher stages pinched to a disk to form H%, starting with k = 1,2, 



Corresponding to the (at least) two points of opposite sign comprising f~ 1 {p), p € cap D 2 of Hi will 
contain (at least) two disjoint planar domains P + and P_ mapping with opposite orientation over the 
cap of Hi . These two planar domains can now be regarded as mapping into H% \Hi), degree one on the 
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boundary of the second stage. B.3 now identifies further planar subdomains P++, P-\ C P+ and , 

P__ C P— mapping with opposite signs over the cap of ■ By induction we obtain 2" disjoint planar 
domains P n . s tring C D, each mapping over the final cap of . The orientation of each mapping is the 
w eight of the string. This proves |B.f | □ 
implies any disk in H£ bounding 7 has exponential area :: 2™. Because the tti(H+) is nonabelian, 



B.l 



an algebraic — weight base argument — for this area estimate is not easy. 

However, the proof of thinness for a suitably proven null homotopy of 7 is easily given in the algebraic 
context. We simply illustrate the initial steps for shrinking 7 in with only a linear increase of its 
length: 

7__-> _ 1212__-> 3434212 -> 5656434212 ->_ 7878656434212_ -> 878656434212_ 656434212 -> 
678786434212 68786434212 434212 456564212 -> 478786564212 48786564212 46564212 
46787864212 4687864212 ->■ 212 234342 256564342 -!• etc., where we always use a relation to 
increase the leftmost odd letter until it reaches 7 (= 2n — 1) and may be canceled. 

The procedure above is identical to the YouTube video^S showing how to solve Puzzle Master "Des- 
perado." 

The essential features of the problem are still present in a simplified picture where the ambient fun- 
damental group is only Z: Consider a slab in K 3 with an unknotted but geometrically interesting arc a 



joining top to bottom. We draw a below (Figure B.2) so that the loop 7, also illustrated, bounds an 



embedded in the complement of a. (Find it!) By a mild extension of the arguments used to prove 



Proposition B.l it may also be proved that the area of the smallest disk A C slab\a with dA = 7 



also grows exponentially with the number n of self- feeding stages (n = 4 in Figure B.2). Of course the 
presence of if„ confirms that 7 can be homotoped to a point so that its length increases only linearly 
with n. 




FIG. B.2: 



In this geometry, clearly loops with zero winding defuse only slowly (while points defuse quickly). 
Since slab\a is homeomorphic a solid torus, it is natural to wonder if the circular coordinate — even 
though not geometrically a Cartesian product — might be interpreted as imaginary time in an exotic 
finite temperature path integral. 



Appendix C: Gropes 

As a final example we complete half gropes H n , to gropes G„ and G+. If caps are present, this 
leads to interesting presentations of the trivial group, associated now to the commutator series (rather 
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than the 1. c. s. of SectionB). The presentation of tti(G^) = {e} has exponentially (in n) many generators 
and relators; however, each generator is the consequence of only linearly many relators, via the many 
half groups H+ C G+. We start with a geometric picture of G^ where some of the caps and even the 




FIG. C.l: 



final surface stages have become too small (in the illustration) to draw carefully. In order to maintain the 
correspondence between area and number of group relations we should actually think of each punctured 
torus T piece of each stage as the same size and shape. In words start with a punctured torus To, glue 
T 00 an d T 01 to its meridian and longitude. Then continue to strings of length n gluing T„_ strin g. This 
produces G n . To produce G+, continue by adding 2™ disk to the meridian and longitudes of the top 
stage. 

The presentations are: 

n 

7Ti(G n ) ={0, 1,00,01, 10, 11, . . . , 11 • • • 1 | each string of length < n (c.l) 
is the commutator of its two extensions} 

7Ti(G+) = {above presentations + the relations : all n-strings are trivial} = {e}, the trivial group. 
Analogously to the results of Section B we have: 

Proposition C.l. Any map of a disk into G+ bounding 7 must pass over at least 2™ caps (counted with 
multiplicity) and therefore have area exponential in n. □ 

Proposition C.2. 7 bounds thin disks A mapping into G+ in the sense that 7 may be homotoped to a 
point along A without ever increasing its length more than linearly in n. 

Like Section A the space with poorly diffusing loops is now simply connected. Unlike Section A the 
"parent group," 7Ti(G„), the group before adding the trivializing relations, is nonabelian. 

A final remark. Looking for impediments to loop diffusion has brought us into the heart of wild topology. 
Consider an infinite grope G^ with geometrically shrinking stages. Take a tapered neighborhood Af(Goo) 
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which becomes thinner out toward the higher stages and complete with the dyadic Cantor set of limit 
point to Goo. This closed neighborhood A/"(Goo) is nothing other than the famous Alexander horned 
ball, the exotic closed complementary region of Alexander's "horned" embedding of the 2-sphere into 
the 3-sphere. 



